1. IntroductionThe theory of topological states has been greatly developed since the discovery of the quantum Hall effect by Klitzing[1,2] 40 years ago. By now physicists have proposed a variety of novel topological states, including topological insulator,[3–8] topological semimetal,[9,10] topological superconductor,[11–14] and axion insulator.[15,16] The nontrivial boundary or surface states of these systems give topological states the non-dissipated channel and other transport properties that traditional materials do not have. In the past two years, the higher-order topological states[17–22] have received a lot of attention. Different from the traditional topological insulator, the higher-order topological insulator has a d – n dimensional topological surface state, where d is the dimension of the system and n is the order of the higher-order topological state. When n = 1, it corresponds to the general topological state. For n > 1, it is generally referred as the higher-order topological state. Taking two dimensions (2D) as an example, the traditional 2D system has a band gap without elementary excitation. On the contrary, in the higher-order systems, the spectrum of the edge state needs to be opened[17–19] and 0D topological corner states are generated inside the band gap. The initial models usually have the mirror or quadruple rotation symmetry, which ensures that the edges (surfaces) of the samples have a mass domain wall[23] structure. With the proposal of the initial theoretical model,[24] the topological corner state was also confirmed in experiments.[25] Subsequently, a series of experiments also proved the diversity of higher-topological states.[23,26–28]
With the development of topological materials in condensed matter physics, the disorder effects on topological states attract a lot of research interest.[29–43] The conventional numerical characterization methods include transport simulation,[29,35,44] local current distribution simulation,[30,44] and topological index calculation.[34,37,40,41] However, these methods all have limitations in the study of the higher-order topological insulators. For example, the quantized conductance without fluctuation and the local current distribution can demonstrate the existence of edge (surface) states as well as the topological properties of a topological insulator. However, due to the presence of a 0D bound state, the topological properties of the second-order topological insulator system cannot be characterized by the transport simulation and local current distribution. Although a one-dimensional system also presents the 0D bound state, one can calculate the winding number[37,41] to separate the trivial state from the topological one. The winding number, however, cannot be directly used in a higher-order topological system with 0D bound states. Thus, there exsits an urgent need to explore new methods to study the disorder effect on the higher-order topological states.
In the past few years, machine learning has gained great development and wide applications, especially in the field of image classification. Image classification, which has sufficiently high accuracy[45] based on the convolutional neural network, is also used in the research of condensed matter physics.[46–54] Since the wave function distributions for different topological phases are distinguished, machine learning should be useful to determine the 0D bound states, which will make up for the shortcomings of the traditional methods. Most recently, Araki and his collaborators firstly studied the disorder induced phase transition[55] between the higher-order topological insulators, metal, and insulators by using the convolutional neural network method. However, no attention has been paid to whether there is a phase transition between the higher-order topological states and other topological phases.
In this paper, we study the phase transition of magnetic higher-order topological insulators in a disordered environment via machine learning. Combined with the non-commutative geometric method, we compare the similarities and differences of the phase diagrams. The stability of the higher-order topological insulators under weak disorder is obtained and our results are consistent with the previous study.[55] Furthermore, the phase diagram calculated by the convolutional neural network presents the phase transition from the normal insulator to the Chern insulator and from the higher-order topological insulator to the Chern insulator. By comparing the Chern number distribution, we find that the phase transition from the normal insulator to the Chern insulator does exist, which is consistent with the results of deep learning. However, there exists no disorder induced transition from the higher-order topological insulator to the Chern insulator. This result indicates the limitation of deep learning as well.
2. Higher-order topological insulatorCompared with the traditional topological insulators, higher-order topological states have edge (bound) states of (d – n) dimensions protected by the topological symmetry, where d is the dimension of the system and n is the order of the second-order topological states. Taking a two-dimensional topological insulator as an example,[4] its Hamiltonian can be derived from a three-dimensional topological insulator and written as
where
i =
x,
y and
τ,
σ correspond to Pauli matrices on orbital and spin spaces, respectively,
m is the mass term, and
Hw represents the Anderson disorder. When
λ = 0, ±(
m +
tcos
kx +
tcos
ky) appears accidental degeneracy and the cross points are not located at the high symmetric point. The degeneracy in the energy spectrum will be removed by the coupling term for
λ ≠ 0. Then, a non-trivial topological insulator with a one-dimensional gapless topological edge state will be obtained. Firstly, in order to realize the second-order topological state, the edge state should be gapped with a newly introduced time-reversal-symmetry-broken term
[17]
when
kx ≠
ky,
HΔ is not zero. The time reversal symmetry of the system is destroyed, and the edge state is gapped, which causes a band gap at the edge states. With the evolution of
kx,
ky, the sign of
HΔ will be reversed, which means that the mass domain wall will appear on the limited sample edge. Thus, a bound state forms at the corner of the sample and it is a second-order topological insulating state with
n = 2. Then, in order to study the change of the magnetic second-order topological state, a zeeman splitting term induced by magnetization is introduced
[22]
where
B is the magnetic field. Now, the phase diagram varying with
B and the mass term
m in a clean sample can be obtained. According to the band gap closure conditions at four high symmetric points
[22] (0,0), (0,
π), (
π,0), (
π,
π), the phase boundaries satisfy the following relation:
Other parameters are set as
t = 1,
Δ =
t/4,
λ =
t, so the phase boundary can be determined. As shown in Fig.
1, when
B = 0, there exist two different phases corresponding to the second-order topological insulators (|
m| < 2
t) and the normal insulators (|
m| > 2
t).
The zeeman term induces band gap closure and introduces a third phase: Chern insulator (CI). In order to obtain the eigenstates and eigenvalues under the finite size condition, the discretization method with cos ka = (eika + e−ika)/2 and sin ka = (eika − e−ika)/(2i) is used, where a is the lattice constant. We set a = 1 for simplicity. Then, the Hamiltonian can be rewritten in a tight-binding form as
Here,
ai (
) represent the four annihilation (creation) operators of point
i, which has four orbital degrees of freedom. The on-site potential matrix is
T0 =
mτzσ0 +
Bτ0σz +
εiτ0σ0, and
εi represents the Anderson disorder, which randomly distributes in [−
W/2,
W/2]. The
x-direction and
y-direction hopping matrices are
and
, respectively.
We choose the position of the blue dotted line shown in Fig. 1 (m = t) to get the phase transition of the second-order topological states. In order to confirm the phases with different magnetic field B, the band structure of a nanoribbon along x-axis is plotted in Fig. 2 [see Figs. 2(a)–2(c)]. Although the edge states still exist for B = 0, the curves are gapped. This is because the HΔ term breaks the time reversal symmetry and makes the topological insulator become a second-order topological insulator. When the magnetic field increases, the band gap of the edge states closes when B = 2t. Subsequent calculations indicate that the Chern number at this location is quantized [Chern number = 1, see Fig. 6]. As the magnetic field continues to increase to B = 4t in Fig. 2(c), the edge state finally disappears while the bulk gap still exists, which corresponds to the normal insulator. In order to examine the existence of the second-order topological states, we also calculate the eigenvalue evolution with the variation of magnetic field B for a 32 × 32 square sample in Fig. 2(d). When |m| < 0.7t, zero energy states appear [see Fig. 2(d)]. Due to the finite size effect, the range of zero energy states is less than the theoretical prediction |m| < t. In a word, these results are consistent with the phase diagram shown in Fig. 1.
Previous research[55] mostly focused on on the stability of the second-order topological states with disorder and the transition between the metal and insulator phases. In the next section, we investigate whether the second-order topological insulator can transit into the Chern insulator by a machine learning method.
3. Phase diagram calculated by machine learning methodIn this section, we briefly introduce the structure of the convolutional neural network adopted in our work. Since there exist only three cases [see Fig. 4] for clean samples, Lenet-5 convolutional neural network framework[45] is adopted. We construct the neural network with the help of Tensorflow. The entire neural network is composed of two convolution layers, two full connection layers, and an output layer. In addition to the output layer, we select Relu as the activation function. Since the wave function is a grayscale graph without RGB value, we use four convolution kernels with the kernel size 5 × 5 × 1 (5 × 5 corresponds to the width and 1 indicates the depth) at the beginning. The output of the convolution layer is pooled immediately afterwards. If not specified, the convolution step size defaults to 1 and the pooling kernel size defaults to 2 with pooling step set to 2. On the second layer, we select eight convolution kernels with kernel size 5 × 5 × 4 instead. Batch normalization is introduced after pooling to adjust the data structures. The results of the convolution layer are input into the full connection layer. We choose two full connection layers by using 80 and 32 neurons, respectively. The output layer has 3 outputs and the activation function is softmax. The outputs of the final layer are the normalized probability and the corresponding labels of the maximum probability, in other words, the Chern insulator, the second-order topological insulator, and the normal insulator. First, we select the blue line position as shown in Fig. 1 to generate the training and test data. B distributes in [−4t, 4t]. Then, 20000 groups of B are randomly selected and the corresponding wave functions are calculated. Subsequently, different labels are attached according to the value of B (|B| < t, corresponding to the second-order topological insulator; t ≤ |B| < 3t, corresponding to the Chern insulator; otherwise, considered as the normal insulator). Figures 3(a)–3(c) show the wave function distributions of the Chern insulator, the normal insulator, and the second-order topological insulator, respectively. For the normal insulators, the density of state is mainly concentrated in the center of the square sample as shown in Fig. 3(b). While the the Chern insulator [Fig. 3(a)] has an edge state, where the wave function is distributed in the interface with vacuum. For the second-order topological insulators, because of the corner states in the band gap of the edge states [Fig. 3(c)], the spectrum of this state is closer to the Fermi energy than that of the edge state. This means we can distinguish these different phases by the wave function distributions, which is also an important basis for deep learning classification. We choose the wave function of the eigenvalue (below and closest to the Fermi surface) as the final input wave function. 20000 sets of data are divided into 15000 sets of training data and 5000 sets of testing data. After 100 rounds of training, we confirm that the accuracy of the network has reached 99.99% and stop the training. Then, within the same range of [−4t, 4t] for B, the disordered W is gradually increased and the disordered wave function is generated. Afterwards, the disordered wave functions (after being averaged 40 times) are imported into the trained network. Then, the final result is obtained.
Figure 5 shows the output of the trained neural network by using the disordered wave function as the input, where the vertical axis is the magnetic field B and the horizontal axis is the disorder strength W. Since the wave function under strong disorder may produce a completely different distribution from the training samples, we only select the disorder strength of 0 ≤ W ≤ 5t.
Figures 5(a) and 5(b) show the maximum probability of the output of the neural network and the corresponding phase of the maximum probability, respectively. Firstly, the probability distribution of most areas is higher than 99%, which indicates that the network training is consistent with expectation. Furthermore, the probability of the phase boundary is about 50%, which is significantly lower than the probability on its both sides. This is due to the fact that the phase boundary may have the features of two different phases, so the network cannot give a high probability for either of the two phases.Secondly, we pay attention to the phase diagram in Fig. 5(b). When B = 0, the topological state of second-order is not affected by the disorder strength as large as W = 3.5t. However, with the increase of magnetic field B, the stability of the topological state is slightly weakened. This system also exists a topological Anderson transition induced by disorder, which indicates a phase transition from the normal insulator to the Chern insulator [see the red line in Fig. 5(a)]. Moreover, the network suggests that there exists a phase transition from the second-order topological insulator to the Chern insulator. Since our training process only contains three phases (two of them are the topological nontrivial phases), a comparison between Fig. 5 and other results is also needed to determine whether the judgment of the two phase transitions (the normal insulator to the Chern insulator and the second-order insulator to the Chern insulator) is correct or not.
4. Chern-number analysisIn this part, we will use the non-commutative geometry method to calculate the Chern number of disordered samples. The difference of phase diagram obtained by non-commutative geometry and deep learning method is also be investigated.
We firstly give a brief review of the Chern number calculation based on non-commutative geometric method.[56–61] According to the perturbation theory, the Chern number of an infinite system can be expressed as follows:
where
P(
k) corresponds to projection operators for all occupied states and
kx,
ky present the momentum of
x and
y direction, respectively. Consider
kx,
ky as discrete ones. The Brillouin zone is divided into
N ×
N grid and
kn = (
n1Δ,
n2Δ) with
n1,2 = 1,…,
N and
Δ = 2
π/
N. Since the discretization makes the brillouin zone discontinuous, the integration can be expressed as
In the whole calculation process, the most important step is to calculate
∂kiPk. The analytic solution in the continuous momentum space can be easily obtained with the help of momentum(
kx,
ky) dependent eigenvalue. For the discrete momentum space,
∂kiPk should be calculated according to the definition of differentiation
where
Δ1 = (
Δ,0),
Δ2 = (0,
Δ), and
Q can take arbitrary value between 0 and
N/2. For infinite system, the space projection operator is the continuous function in momentum space and should satisfy the following form
Pk = ∑
nεT2bne
ikn. Therefore, we only need to find the approximation form of e
ikx in differential form, which can be written as
cm is a vector
c = (
c1,
c2,…,
cQ). Then, we perform Taylor expansion on sin(
mΔxi) to the order of
δ2Q, where
x1 (
x2) correspond to
x, (
y) direction. Thus,
c must satisfy the following equation in order to let
:
where
Aij =
j2i − 1 with
i,
j = 1,2,…
Q. Now, the expansion coefficient
cm is obtained and (
∂ki −
δki)e
ikx = 0 is satisfied to
Δ2Q order. These results are determined by (
kx,
ky), but the Chern number for dirty samples is based on a discrete lattice model with finite size. In real space, we have
δkiPk = −i [
xi,
P] after turning i
∂kx (i
∂ky) into
x(
y). Then,
δkiPk can be rewritten as
Therefore, the Chern number in real space can be rewritten as
where
P is the projection operator of occupied states on the basis of real space and |
n,
α〉 is the real space coordinates. When the real space coordinate
x is acting on |
n,
α〉, the corresponding space coordinates will be obtained. The index theory shows that the Chern number should be quantized as long as the Fermi surface locates in the bulk gap (not caused by the discretization momentum because of finite size effect). Futhermore, the quantized Chern number even varies with the existence of the bulk gap.
Based on the real space calculation, we obtain both the Chern number for clean samples and the evolution of the Chern number under the variation of disorder strength for dirty samples. As shown in Fig. 6, the system has the Chern number of quantized C = 1 in the range of t < |B| < 3t for clean samples. This region corresponds to the Chern insulator. In addition, the Chern number changes from C = 0 to C = 1 with the increase of disorder strength W for |B| > 3t. Thus, the topological Anderson phase transition (the normal insulator to the Chern insulator) is confirmed, which agrees with the result obtained through the convolutional neural network. Finally, the region of |B| < t that we most concerned corresponds to the second-order topological insulator. However, no quantization is obtained by increasing W in this region, and the area of quantization is much smaller than the result achieved through machine learning [the Chern insulator region in Fig. 5(b)].
In order to explain this contradiction between the two different methods, we make a brief discussion below. We present the wave functions corresponding to the two marked points (square and circle) in Fig. 6. Figure 7(a) shows the wave function distribution for the square symbol marked in Fig. 6, where the Anderson disorder turns the normal insulator into the Chern insulator. It has obvious edge state characteristics. Even though the wave function distribution is much different from that in Fig. 3(a), the neural network can still present a correct result. However, the wave function [see Fig. 7(b)] still holds the feature of edge state, although the Chern number at this point is not quantized [marked by circle in Fig. 6]. This means the edge state without quantized Chern number is not the edge state of the Chern insulator, though the wave function distribution is very similar to the Chern insulator one.
In order to explain the emergence of the edge state distribution, we investigate the wave function distribution in detail [see Figs. 8(a)–8(c)]. In the previous calculation, each wave function distribution was plotted by the average of 40 individual wave function distribution diagrams. In Figs. 8(a)–8(c), we directly take only one disordered sample and get the wave function distribution. By randomly resetting the disorder for three times, we get three random wave function distributions. It can be seen that there are bound states in Figs. 8(a)–8(c), which are all at the boundary, although they are not at the same position. This result means that the position of the bound states at the boundary can be random. We use dashed red lines to duplicate the bound states of Figs. 8(b) and 8(c) into Fig. 8(a). Obviously, the region of the bound state at the boundary will be larger than the result of any one graph after averaging the sum of the three graphs. Therefore, if more are added during the calculation, there is a high probability of producing a wave function distribution similar to that of the edge state of the Chern insulator. This is the most likely reason for misjudgment.
In order to confirm whether there is a phase transition, we use the self-consistent Born approximation method[38,39] to calculate the evolution of the bulk gap with disorder. The self-consistent Born approximation is based on the following formula:
where Σ
d is the self-energy caused by disorder,
W is the disorder strength,
EF is the Fermi level, and
H(
k) =
HTI +
HΔ +
HZ. The parameters are the same as those in Fig.
7 with
B = 0. By solving Eq. (
13), we obtain Σ
d and the renormalized Hamiltonian Σ
d +
H(
k). Based on the renormalized Hamiltonian, the evolution of the bulk gap versus disorder
W is obtained, as shown in Fig.
8(d). The bulk band gap increases at the beginning and then decreases. However, the bulk gap exists during the whole process, which means that the system does not undergo a phase transition. Thus, we infer that figures
8(a)–
8(c) are still in the higher-order topological insulator phase. The reason for the shift of the bound states in space may be that the corner state of the higher-order topological insulator is very sensitive to the localized on-site energy. Due to randomness of disorder, the on-site energy of a certain position is higher and the on-site energy of another certain position is lower, so the bound state will move in space, thus forming the effect of edge state after averaging the individual wave function distributions.
5. ConclusionIn summary, we studied the disordered phase transition of the magnetic second-order topological insulators using the deep learning method. We found that the second-order topological states are stable when the disorder strength satisfies W ≤ 3.5t and the neural network can accurately judge the topological Anderson phase transition from the normal insulator to the Chern insulator. However, by comparing the Chern number under disordered conditions through the non-commutative geometry method, we found that although the neural network predicts a transition from the second-order topological state to the Chern insulator, such phase transition does not happen actually. This anomaly originates from the fact that for the second-order topological insulator, the bound states randomly appear at the edge of the sample under the strong disorder. After the ensemble average, the averaged bound states make the wave function distributions capture the edge state feature, while it is absent for the individual samples.